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ABSTRACT 

A procedure which may be used to project the 
frequency distribution of one test onto that of another test is 
described and illustrated. The procedure is useful when a test 
developer wishes to construct an alternate form with preferred 
distributional characteristics. For example, the test developer may 
wish to construct a new test form with a completely different set of 
items but which yield comparable measures of central tendency, 
dispersion, skewness and kurtosis, reliability (KR-20 , KR-21, Kappa), 
norms, pass-fail rates, etc. Rather than constructing alternate 
parallel tests, the test developer may wish to construct alternate 
forms which are substantially easier^or harder. The procedure is also 
applicable in that situation. The technique assumes that items on 
both tests have been previously calibrated and equated with the 
one-parameter 'Rasch model. The three-parameter logistic model may 
also be used when number-right rather than pattern scoring is used. A 
basic skills multiple choice test consisting of 21 items or 32 items 
provides the illustration. (Author/GDC) 
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ABSTRACT 

This paper describes and provides an example of a' procedure 
Which rnay .be used to project the frequency distribut ion -of one 
test on to that of another test.. The procedure is useful when, 
a test developer wishes to construct an alternate test form 
with preferred distributional characteristics. For example 
the test developer may wish to construct a neu test form 
with a completely different set of items but which yield 
comparable measures of central tendency, dispersion, skewness and 
kurtosis, reliability (KR-20, KR-21 , kappa), norms, pass-fail 
rates, etc. Rather than construct ing . alternate parallel tests 
the test developer may uish to construct alternate forms uhich '. 
are substantial]./ easier or harder. The procedure is also 
applicable in this situation'. The technique assumes that ' 
items on both tests have been previously calibrated and equated 
with .the one-parameter Rasch model ( the procedure may be easily 
extended to the three-parameter 1 ogist ic model when number-right " r 
r ather than pattern scoring is used ). 
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INTRODUCTION' ■ 

In this paper ue address the problem of estimating the 

observed score distribution of an alternate form of a test* 

The alternate form might be a lengthened form of the 

original test, or be composed of a compl etel y di f f erent set of 

i t ems . * . 

There are two common approaches to est imat ing observed score 

distributions. The first is with the negat i ve-iv/per geomet r ic 

distribution within the contest of the binomial error model and 

multiple-matrix sampling (Lord, 1962)* The second common method 

i£,*to- use item response t h ejpr y mode 1 s and assume that the 

conditional distribution of rau scores given .0 is a compound 

binomial distribut ion [usually estimated by a two-term 

approximation] (Lord, 1963» p; 524-526). The technique used in 

this paper also uses kfc^m response theory models. However, in the 

approach used here the conditional distribution of rau scores is 

calculated directly from the model. ' 
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Determining t he Con d i t 1 onal Frequency Distribution 

Given a test composed of m items ( each with known item 
parameters ) the conditional distribution- of the rau score X can 
be determined This is done by letting the vector J2 
denote the responses of an examinee with ability 8 to an m-item 
test. The response on_the ith item, Ui , is either 0 or 1. For 
the ith item the probability of Ui of an examinee with ability 9 
is 

P(Ui 18) = Pi Qi • d) 

where Qi = 1 - Pi . 

Since it is assumed that item responses are independent the 
probability of "any one of the 2™ observed vector of responses is 
equal. to the product of t he ■ ?r obab i 1 i t i es for each item repbnse.. 
Hence,- for an examinee with ability 9 the probability of the 
vec t or U i s 

PCUI8) = 77 Pi Qi (2 ) 



er|c i. 2 
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The condit ionfal distribution of , scores at each ability 
9, f(XI8), can nou be determined. Letting the raw score X equal 
Z Ui th.e conditional distribution of X given 9 is 

(?) r 

f(XI8) = £ 



K-l 



;■- < ' 



The subscript k is used to surfi each of the 



9 such that k = i 



subscript is a1uays(^J 
result i n X • 



' 2 (D * 



^ ^ J vectors for each 



The upper limit * of the 



because there are 



vectors oT U which 



Determining the Marginal Frequency Distribution 



Equations 2 and 3 may be used to rpake distributional 
projections from an old m-item test to a neu; n-item test. For an 
old m-item test let the frequencies at ability values of 80, 81, 
.., 9m, be NO, Ml,.-., Nm. Ue^are interested in estimating the 
frequencies NO, Nl,.., Nn on the neu n-item test at ability" 
values 90, 81,.., 8n. Letting Y indicate the rau score 
on the neu n-item test equation 3 can be used to obtain 

I 
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f(YI9) for values of 9 associated with the raw scores on 'the old 

m-item t set 'Taking the weighted mean of f ( Y 1 9) across the 

ability range from 90 to 8m will yield the projected marginal 

frequency distribution of raw scores f(Y). That is 

m 

f(Y) = £f <YI8j)Nj/N. . (4) 
^° 

nETHOD ' 

4 

The data for the example were obtained from a bank of 
calibrated items on a basic skills mul t ipl e-'choice test. During 
the field testing of the bank a 21 item form was field tested 
jv along with a 32 item form. Boi'-h forms Qere administered 
simultaneously to two random groups of examinees. There were no 
common items among the two forms nor common people between the 
random groups. The examinee performance on the 21 item test was 
used to predict the frequency distribution on the 32 item test. 
Since the 32 item test uas actual 1 y administered during the field 
testing it was possible to. compare its predicted frequency 
distribution to its actual frequency distribution. 
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RESULTS ' ' 

Table 1 shows the rau scores, ability estimates, and the 
predicted and actual frequency distributions on the 32 item test. 
A comparison of the predicted and actual frequency distributions 
reyeaT that they are very similar. In fact much of the 
discrepancy .between the two distributions would be eliminated if 
the actual distribution were smoothed ( nojte^ the procedure 
described in this paper can also be used for smoothing 
distribute on? ) . > 



Table 2 further indicates that the predicted and actual 
frequency distributions are similar. The means, standard 
deviations, and KR-20,s are virtually identical. 
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DISCUSSION ^ 

THis. paper described tJ~»e mathemat icaT basis and provided an 
empirical example of a method of estimating the frequency 
distribution Drt an alternate foVm'of a test. The empirical 
example was one in uhich performance on a 21-item test, was used 
4 to estimate derformance on a 32-item tpst. The tuo tests 
contained tuo different sets of items (from a calibrated bank) 
and uere administered to tuo different groups of examiae.es 
(from the same population). The r e su 1 t s" i nd i cat ed that the 
estimated distribution uas .sufficient 1 y close to the actual 
distribution for practical uork. 
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Predicted And Actual 



Raw 


Logit 


Score 


Ability 


0 


-5.148 


1 


• -4,074 


2 


-3.299 


3 


-2.814 


4 


-2.449 


5 


-2.149 


6 


-1.892 


7 *> 


-1.663 


8 


-1.454 


9 


-1.261 


10 


-1.079 


11 


-.906 


12 


-.740 


13 


-.579 


14 


-.421 


15 


-.266 


16 


-.112 


17 , 


.042 


18 


.196 


.19 


.352 


20 


.511 


21 


* .674 


22 


.843 


23 


1.020 


24 


1.207 


25 


1.408 


26 


1.628 


27 


1.874 


28 


2.154 


29 


2.508 


30 


2.973 


31 .1 


3.72$ 


32 


' 4.776 



Table 1 

Distribution For 32 Item Test 



Predicted Actual 
Frequency Frequency 



.21 


.00 


.50 


.00 


.71 


.50 


• 80 


.99 


.86 


.50 


1.00 


1.49 


1.26 . 


1.99 


1.60 


3.48 


1.95 


1.49 


2.26 


1.99 


2.53 


1.99 


2.79 


5.47 


3.06 


4.97 


3.33 


4.97 


3.58 


4.47 


3.80 


4.47 


3.98 * 


2.98 


4.14 


4.47 


4.28 


1.99 


4.39 


2.98 


4.45 


2.48 


4.46 


' 4.47 


4.44 


3.48 


4.42 


2.98 


4.38 


2.98 


H * JU 


2.98 




1.99 


3.91 


4.47 


3.62 


1.99 


3.37 


6.47 


3.37 


3.98 


3.88 


3.98 


4.19 


6.47 


100.00 


100.00 
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Table 2 



Descriptive Statistics For the Predicted and Actual 
Distribution of the 32 Item Test 



Descriptive Statistic Predicted Actual 

Mean Raw Score 19,72 19.31 

Standard Deviation Raw Score 7,62 8.26 

Mean Logit Ability ,68 .69 

Standard Deviation Logit Ability 1.66 1.84 

KR-21 r .so ' .92 
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